Combining sequence and structure information in protein alignments

نویسندگان

  • Abel Rodriguez
  • Scott C. Schmidler
چکیده

For distantly related proteins, alignmentsbased on structural information are more reliable than traditional sequence alignments. However, when structural comparison leaves some ambiguity in alignment, sequence information can provide valuable additional information to discriminate between multiple alternatives. In this paper we present a Bayesianmodel that incorporates sequence information into structural alignments in an automatic and adaptive fashion. By use of an estimated measure of conservation between sequence and structure, we construct an ensemble sequence/structure alignment tool capable for building refined protein alignments and identifying conserveed functional regions. This model also provides a natural tool for using structural alignment to determine evolutionary distance, which may allow phylogenetic analysis of proteins at significantly larger divergence times. Two examples are presented and compared with previous analysis in the

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

3DCoffee: combining protein sequences and structures within multiple sequence alignments.

Most bioinformatics analyses require the assembly of a multiple sequence alignment. It has long been suspected that structural information can help to improve the quality of these alignments, yet the effect of combining sequences and structures has not been evaluated systematically. We developed 3DCoffee, a novel method for combining protein sequences and structures in order to generate high-qu...

متن کامل

Combining the GOR V algorithm with evolutionary information for protein secondary structure prediction from amino acid sequence.

We have modified and improved the GOR algorithm for the protein secondary structure prediction by using the evolutionary information provided by multiple sequence alignments, adding triplet statistics, and optimizing various parameters. We have expanded the database used to include the 513 non-redundant domains collected recently by Cuff and Barton (Proteins 1999;34:508-519; Proteins 2000;40:50...

متن کامل

Combining evolutionary information and neural networks to predict protein secondary structure.

Using evolutionary information contained in multiple sequence alignments as input to neural networks, secondary structure can be predicted at significantly increased accuracy. Here, we extend our previous three-level system of neural networks by using additional input information derived from multiple alignments. Using a position-specific conservation weight as part of the input increases perfo...

متن کامل

COMBOSA3D: combining sequence alignments with three-dimensional structures

UNLABELLED COMBOSA3D is a program that allows sequence conservation to be viewed in its proper three-dimensional environment. It superimposes sequence alignment information onto a protein structure using a customizable color scheme, which is also applied to a textual sequence alignment for reference. AVAILABILITY The program can be tested at http://www.bioinformatics.org/combosa3d/, and the s...

متن کامل

Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases

MOTIVATION Sequence alignment techniques have been developed into extremely powerful tools for identifying the folding families and function of proteins in newly sequenced genomes. For a sufficiently low sequence identity it is necessary to incorporate additional structural information to positively detect homologous proteins. We have carried out an extensive analysis of the effectiveness of in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006